Learning Formulation and Transformation Rules for Multilingual Named Entities

نویسندگان

Hsin-Hsi Chen

Changhua Yang

Ying Lin

چکیده

This paper investigates three multilingual named entity corpora, including named people, named locations and named organizations. Frequency-based approaches with and without dictionary are proposed to extract formulation rules of named entities for individual languages, and transformation rules for mapping among languages. We consider the issues of abbreviation and compound keyword at a distance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Translating-transliterating named entities for multilingual information access

monolingual named entities. Extending them to multilingual entities is becoming important because a large amount of multilingual materials are generated and disseminated over the Web. The fundamental issues in processing multilingual named entities are recognizing them and finding their correspondence. Embedded technologies include learning formulation and transformation rules for multilingual ...

متن کامل

بهبود شناسایی موجودیت‌های نامدار فارسی با استفاده از کسره اضافه

Named entity recognition is a process in which the people’s names, name of places (cities, countries, seas, etc.) and organizations (public and private companies, international institutions, etc.), date, currency and percentages in a text are identified. Named entity recognition plays an important role in many NLP tasks such as semantic role labeling, question answering, summarization, machine ...

متن کامل

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features

Named Entity Recognition is an information extraction technique that identifies name entities in a text. Three popular methods have been conventionally used namely: rule-based, machine-learning-based and hybrid of them to extract named entities from a text. Machine-learning-based methods have good performance in the Persian language if they are trained with good features. To get good performanc...

متن کامل

A Multilingual Entity Linker Using PageRank and Semantic Graphs

This paper describes HERD, a multilingual named entity recognizer and linker. HERD is based on the links in Wikipedia to resolve mappings between the entities and their different names, and Wikidata as a language-agnostic reference of entity identifiers. HERD extracts the mentions from text using a string matching engine and links them to entities with a combination of rules, PageRank, and feat...

متن کامل

Invited Talk: Multilingual Named Entity Recognition

The computational research aiming at automatically identifying named entities (NE) in texts forms a vast and heterogeneous pool of strategies, techniques and representations from hand-crafted rules towards machine learning approaches. Hand-crafted rule based systems provide good performance at a relatively high system engineering cost. The availability of a large collection of annotated data is...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Learning Formulation and Transformation Rules for Multilingual Named Entities

نویسندگان

چکیده

منابع مشابه

Translating-transliterating named entities for multilingual information access

بهبود شناسایی موجودیت‌های نامدار فارسی با استفاده از کسره اضافه

A Novel Approach to Conditional Random Field-based Named Entity Recognition using Persian Specific Features

A Multilingual Entity Linker Using PageRank and Semantic Graphs

Invited Talk: Multilingual Named Entity Recognition

عنوان ژورنال:

اشتراک گذاری